首页> 外文OA文献 >The Responsibility Weighted Mahalanobis Kernel for Semi-Supervised Training of Support Vector Machines for Classification
【2h】

The Responsibility Weighted Mahalanobis Kernel for Semi-Supervised Training of Support Vector Machines for Classification

机译:半监督的责任加权马哈拉诺比斯核   支持向量机的分类训练

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Kernel functions in support vector machines (SVM) are needed to assess thesimilarity of input samples in order to classify these samples, for instance.Besides standard kernels such as Gaussian (i.e., radial basis function, RBF) orpolynomial kernels, there are also specific kernels tailored to considerstructure in the data for similarity assessment. In this article, we willcapture structure in data by means of probabilistic mixture density models, forexample Gaussian mixtures in the case of real-valued input spaces. From thedistance measures that are inherently contained in these models, e.g.,Mahalanobis distances in the case of Gaussian mixtures, we derive a new kernel,the responsibility weighted Mahalanobis (RWM) kernel. Basically, this kernelemphasizes the influence of model components from which any two samples thatare compared are assumed to originate (that is, the "responsible" modelcomponents). We will see that this kernel outperforms the RBF kernel and otherkernels capturing structure in data (such as the LAP kernel in Laplacian SVM)in many applications where partially labeled data are available, i.e., forsemi-supervised training of SVM. Other key advantages are that the RWM kernelcan easily be used with standard SVM implementations and training algorithmssuch as sequential minimal optimization, and heuristics known for theparametrization of RBF kernels in a C-SVM can easily be transferred to this newkernel. Properties of the RWM kernel are demonstrated with 20 benchmark datasets and an increasing percentage of labeled samples in the training data.
机译:例如,需要使用支持向量机(SVM)中的内核函数来评估输入样本的相似性,以便对这些样本进行分类。除了诸如高斯(即径向基函数,RBF)之类的标准内核或多项式内核之外,还存在特定的内核量身定制以考虑数据中的结构以进行相似性评估。在本文中,我们将通过概率混合密度模型(例如在实值输入空间的情况下为高斯混合)捕获数据中的结构。从这些模型中固有的距离度量(例如在高斯混合情况下的马哈拉诺比斯距离),我们得出了一个新的内核,即责任加权马哈拉诺比斯(RWM)内核。基本上,该内核强调模型组件的影响,假定要比较的两个样本都来自该模型组件(即“负责任的”模型组件)。我们将看到,在许多具有部分标记数据的应用程序(即SVM的半监督训练)中,该内核的性能优于RBF内核和其他捕获数据结构的内核(例如Laplacian SVM中的LAP内核)。其他关键优势是RWM内核可以轻松地与标准SVM实现和训练算法(例如顺序最小优化)一起使用,并且以C-SVM中的RBF内核参数化而闻名的启发式算法可以轻松地转移到该新内核中。 RWM内核的属性通过20个基准数据集和训练数据中标记的样本所占百分比的证明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号